ATTORNEY DOCKET 
064747.1015 



2 



PATENT APPLICATION 

10/826,959 



The Claims: 

1. (Currently Amended) A method comprising: 

determining that one of a plurality of nodes has failed, each node comprising a 
switching fabric integrat e d onto a board and one or more processors integrat e d onto the 
board; and 

removing the failed node from a virtual list of nodes, the virtual list comprising one 
logical entry for each of the plurality of nodes. 

determining that one of a plurality of nodes has failed, each node comprising a 
switching fabric integrated onto a board and one or more processors integrated onto the 
board; 

removing the failed node from a virtual list of nodes, the virtual list comprising one 
logical entry for each of the plurality of nodes; 

determining that at least a portion of a job was being executed on the failed node; 
terminating at least the portion of the job; 

determining that the job was associated with a subset of the plurality of nodes; and 
deallocating the subset of nodes from the job. 

2-3 (Canceled) 

4. (Currently Amended) The method of Claim 3, Claim L, each entry of the 
virtual list comprising a node status and the method further comprising changing the status of 
each of the subset of nodes to "available." 

5. (Currently Amended) The method of Claim 3, Claim L, further comprising: 
determining dimensions of the terminated job based on one or more job parameters 

and an associated policy; 

dynamically allocating a second subset of the plurality of nodes to the terminated job 
based on the determined dimensions; and 

executing the terminated job on the allocated second subset. 

6. (Original) The method of Claim 5, the second subset comprising a 
substantially similar set of nodes to the first subset. 
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7. (Previously Presented) The method of Claim 5, wherein dynamically 
allocating the second subset comprises: 

determining an optimum subset of nodes from a topology of unallocated nodes; and 
allocating the optimum subset. 

8. (Previously Presented) The method of Claim 1, further comprising: 
locating a replacement node for the failed node; and 

updating the logical entry of the failed node with information on the replacement 

node. 

9. (Currently Amended) The method of Claim 1, wherein determining that one 
of the plurality of nodes has failed comprises determining that a repeating communication 
has not been received from the failed node. 

10. (Currently Amended) The method of Claim 1, wherein determining that one 
of the plurality of nodes has failed is accomplished comprises determining through polling 
that one of the plurality of nodes has failed . 
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11. (Currently Amended) Software encoded in one or more computer-readable 
tangible media and when executed operable to: 

determine that on e of a plurality of nodes has failed, each node comprising a 
switching fabric integrat e d onto a board and one or more processors int e grated onto the 
board; and 

remove the failed node from a virtual list of nodes, th e virtual list comprising on e 
logical entry for each of th e plurality of nod e s. 

determine that one of a plurality of nodes has failed, each node comprising a 
switching fabric integrated onto a board and one or more processors integrated onto the 
board; 

remove the failed node from a virtual list of nodes, the virtual list comprising one 
logical entry for each of the plurality of nodes; 

determine that at least a portion of an job was being executed on the failed node; 
terminate at least the portion of the job; 

determine that the job was associated with a subset of the plurality of nodes; and 
deallocate the subset of nodes from the job. 

12-13 (Canceled) 

14. (Currently Amended) The software of Claim 13, Claim 1L each entry of the 
virtual list comprising a node status and the software further operable to change the status of 
each of the subset of nodes to "available." 

15. (Currently Amended) The software of Claim 13, Claim LL further operable 

to: 

determine dimensions of the terminated job based on one or more job parameters and 
an associated policy; 

dynamically allocate a second subset of the plurality of nodes to the terminated job 
based on the determined dimensions; and 

execute the terminated job on the allocated second subset. 
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16. (Original) The software of Claim 15, the second subset comprising a 
substantially similar set of nodes to the first subset. 

17. (Currently Amended) The software of Claim 15, wherein the software is 
operable to dynamically allocate the second subset comprises software operable to: 

determine an optimum subset of nodes from a topology of unallocated nodes; and 
allocate the optimum subset. 

18. (Previously Presented) The software of Claim 1 1, further operable to: 
locate a replacement node for the failed node; and 

update the logical entry of the failed node with information on the replacement node. 

19. (Currently Amended) The software of Claim 11, wherein the software being 
operable to determine that one of the plurality of nodes has failed comprises the software 
being operable to determine that a repeating communication has not been received from the 
failed node. 

20. (Currently Amended) The software of Claim 11, wherein the software being 
operable to determine that one of the plurality of nodes has failed is accomplished comprises 
the software being operable to determine through polling that one of the plurality of nodes 
has failed . 
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21. (Currently Amended) A system comprising: 

a plurality of nodes, each node comprising a switching fabric integrated onto a board 
and one or more processors integrated onto the board; and 
a management node operable to: 

determine that one of the plurality of nodes has failed, each node comprising 
an integrated fabric; and 

remove the failed node from a virtual list of nodes, th e virtual list comprising 
one logical e ntry for each of the plurality of nodes. 

determine that one of the plurality of nodes has failed; 

remove the failed node from a virtual list of nodes, the virtual list comprising 
one logical entry for each of the plurality of nodes; 

determine that at least a portion of an job was being executed on the failed 

node; 

terminate at least the portion of the job; 

determine that the job was associated with a subset of the plurality of nodes; 

and 

deallocate the subset of nodes from the job. 
22-23 (Canceled) 

24. (Currently Amended) The system of Claim 23, Claim 2L each entry of the 
virtual list comprising a node status and the management node further operable to change the 
status of each of the subset of nodes to "available." 

25. (Currently Amended) The system of Claim 23, Claim 2L the management 
node being further operable to: 

determine dimensions of the terminated job based on one or more job parameters and 
an associated policy; 

dynamically allocate a second subset of the plurality of nodes to the terminated job 
based on the determined dimensions; and 

execute the terminated job on the allocated second subset. 
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26. (Original) The system of Claim 25, the second subset comprising a 
substantially similar set of nodes to the first subset. 

27. (Currently Amended) The system of Claim 25, wherein the management node 
being operable to dynamically allocate the second subset comprises the management node 
being operable to: 

determine an optimum subset of nodes from a topology of unallocated nodes; and 
allocate the optimum subset. 

28. (Currently Amended) The system of Claim 21, the management node being 
further operable to: 

locate a replacement node for the failed node; and 

update the logical entry of the failed node with information on the replacement node. 

29. (Currently Amended) The system of Claim 21, wherein the management node 
bein g operable to determine that one of the plurality of nodes has failed comprises the 
management node being operable to determine that a repeating communication has not been 
received from the failed node. 

30. (Currently Amended) The system of Claim 21, wherein the management node 
is operable to determine through polling that one of the plurality of nodes has failed is 
accomplished through polling . 
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3 1 . (New) A method comprising: 

determining that one of a plurality of node Is has failed, each node comprising: 

at least two first processors operable to communicate with each other via a 
direct link between them, the first processors integrated onto a single first 
motherboard; and 

a first switch integrated onto the single first motherboard, the first processors 
communicably coupled to the first switch, the first switch operable to communicably 
couple the first processors to at least eight second motherboards each comprising at 
least two second processors integrated onto the second motherboard and a second 
switch integrated onto the second motherboard operable to communicably couple the 
second processors to the first motherboard and at least seven third motherboards each 
comprising at least two third processors integrated onto the third motherboards and a 
third switch integrated onto the third motherboards, the first processors operable to 
communicate with particular second processors on a particular second motherboard 
via the first switch and the second switch on the particular second motherboard, the 
first processors operable to communicate with particular third processors on a 
particular third motherboard via the first switch, a particular second switch on a 
particular second motherboard between the first motherboard and the particular third 
motherboard, and the third switch on the particular third motherboard; 
removing the failed node from a virtual list of nodes, the virtual list comprising one 
logical entry for each of the plurality of nodes; 

determining that at least a portion of a job was being executed on the failed node; 
terminating at least the portion of the job; 

determining that the job was associated with a subset of the plurality of nodes; and 
deallocating the subset of nodes from the job. 
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